ON THE VARIATIONAL PRINCIPLE FOR 
GENERALIZED GIBBS MEASURES 



AERNOUT VAN ENTER AND EVGENY VERBITSKIY 

o . 

Abstract. We present a novel approach to establishing the varia- 
tional principle for Gibbs and generalized (weak and almost) Gibbs 
states. Limitations of a thermodynamic formalism for generalized 
\ Gibbs states will be discussed. A new class of intuitively Gibbs 

measures is introduced, and a typical example is studied. Finally, 
we present a new example of a non-Gibbsian measure arising from 
an industrial application. 
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1. Introduction 

t-h ; 

Gibbs measures, denned as solutions of the Dobrushin-Lanford-Ruel- 
. le equations, can equivalently be denned as solutions of a variational 

principle (at least when they are translation invariant). 

Such a variational principle states that when we take as a base mea- 
sure a Gibbs measure for some potential, or more generally, for some 
specification, other Gibbs measures for the same potential (specifica- 
tion) are characterized by having a zero relative entropy density with 
respect to this base measure. 
^ ■ If the base measure is not a Gibbs measure, such a statement need 

not be true anymore. The construction of Xu provides an exam- 
ple of a "universal" ergodic base measure, such that all translation 
invariant measures have zero entropy density with respect to it. Even 
within the reasonably well-behaved class of "almost Gibbs" measures 
there are examples such that certain Dirac measures have zero entropy 
density with respect to it, for a similar weakly Gibbsian example 
see One can, however, to some extent circumvent this problem by 
requiring that both measures share sufficiently many configurations in 
their support. 

For almost Gibbs measures, the measure-one set of good (continuity) 
configurations have the property that they can shield off any influence 
from infinity. On the other hand for the strictly larger class of weakly 
Gibbsian measures, it may suffice that most, but not necessarily all, 
influences from infinity are blocked by the "good" configurations. 

l 
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The situation with respect to the variational principle between the 
class of almost Gibbs measures is much better than with respect to the 
class of weak Gibbs measures [TU] . 

On the one hand, one expects that a variational principle might 
hold beyond the class of almost Gibbs measures. For example, infinite- 
range unbounded-spin systems lack the almost Gibbs property (due 
to the fact that for a configuration of sufficiently increasing spins the 
interaction between the origin and infinity is never negligible, whatever 
happens in between), but a variational principle for such models has 
been found; on the other hand, the analysis of Kiilske for the random 
field Ising model implies that one really needs some extra conditions, 
or the variational principle can be violated. 

The paper is organised as follows. After recalling some basic notions 
and definitions, we discuss Goldstein's construction of a specification 
for an arbitrary translation invariant measure. In Section 3, we consider 
two measures v and \x such that h{y\\i) = and we formulate a sufficient 
condition for v to be consistent with a given specification 7 for \i. We 
also consider a general situation of h{y\\x) = and recover a result 
of Follmer. The new sufficient condition is clarified in the case of 
almost Gibbs measures in Section 4, and for a particular weak Gibbs 
measure in Section 5. We also introduce a new class of intuitively 
weak Gibbs measures. In Section 6, we present an example of a non- 
Gibbsian measure arising in industrial setting: magnetic and optical 
data storage. 

Acknowledgment. We are grateful to Frank den Hollander, Christ- 
of Kiilske, Jeff Steif, and Frank Redig for useful discussions. 



2.1. Notation. We work with spin systems on the lattice Z d , i.e., con- 
figurations are elements of the product space A zd , where A is a finite 
set (alphabet). The configuration space Q = A zd is endowed with the 
product topology, making it into a compact metric space. Configura- 
tions will denoted by lower-case Greek letters. The set of finite subsets 
of Z d is denoted by S. 

For A G S we put fi A = A A . For a G Q, and A G S, a A G Qa 
denotes the restriction of o to A. For a, r] in Q, A G S, cr A r]\c denotes 
the configuration coinciding with a on A, and rj on A c . For A C Z d , 
JF A denote the a-algbra generated by {a x \ x G A}. 

For two translation invariant probability measures \x and v, define 
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if v A is absolutely continuous with respect to fi A , and H A {y\ix) = +00, 
otherwise. 

The relative entropy density h(u\fj) is defined (provided the limit 
exists) as 

h{v\n) = lim -L-H^vlfj), 

n— >oo \A n \ 

where {A n } is a sequence of finite subsets Z d , with A n /* Z d as n — > 00 
in van Hove sense. For example, one can take A n = [— n, n] d . 

A potential U = {U(A, -)} AeS is a family of functions indexed by 
finite subsets of Z d with the property that U(A, oS) depends only on 
uj a . A Hamiltonian H A is defined by 

A'nA^0 

The Hamiltonian H A is said to be convergent in a if the sum on the 
right hand side is convergent. The Gibbs specification 7^ = {7A}Aes 
is defined by 

TaM^aO = -, y 

Y expf -H)[(u) A cr A c) J 

provided H A is convergent in every point uj A a Ac , uj a G VL A . Formally, 
if H A is not convergent in every point u G f2, 7 17 is not a specification 
in the sense of standard Definition 12.11 Nevertheless, in many cases 
(e.g., weakly Gibbsian measures, see Definition 15. II below) 7^ can still 
be viewed as a version of conditional probabilities for [i: 

\i{u A \o A *) = j A (u A \a A c) (/i-a.s.). 
2.2. Specifications. 

Definition 2.1. A family of probability kernels 7 = {7a}ags is called 
a specification if 

a) Ja(F\ 4 ) is J 7 a? -measurable for all A G S and F G T ; 

b) ^ A {F\uS) = l F (u) for all AeS and F G T A c; 

c ) 7a'7a — 7a' "whenever A C A', and where 

(-1a^a){F\uj) = J j A {F\ri)jA'(dri\uj). 

Definition 2.2. ^4 probability measure // zs called consistent with a 
specification 7 f denoted by fx e G(j)) if for every bounded measurable 
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function f one has 
(2.1) / fdn= I 7a(/)^- 



If n is consistent with the specification 7, then 7 can be viewed as 
a version of the conditional probabilities of fi, since (|2.1|) implies that 
for any finite A 

ja{A\uj) = E At (I A |J r A c)(a;) (/i-a.s.), 

where JF AC is the cx-algebra generated by spins outside A. 

Definition 12 . 1 1 req uires that for every u, j\(-\u) is a probability mea- 
sure on JF, and that the consistency condition (c) is satisfied for all 
uj £ Vt. In fact, when dealing with the weakly Gibbs measures, these 
requirements are too strong. Definition 12. II can be generalized |15| p. 
16], and this form is probably more suitable for the weakly Gibbsian 
formalism. 

2.3. Construction of specifications. In [7J, Goldstein showed that 
every measure has a specification. In other words, for every measure 
/j, there exists a specification 7 in the sense of definition 12. 11 such that 
/i£ 5(7). Let us briefly recall Goldstein's construction. 

Suppose /i is a probability measure on Q = A 1 , and let {A n }, A n £ 
S, be an increasing sequence such that U n A n = Z d . For a finite set 
A £ Z d , and arbitrary 77 a £ A A , uj £ A z \ A , define 

//(77a|^a=) := MMI-^aOM; 

where [^a] = {C £ ^ : C|a = VA-}- By the martingale convergence 
theorem 

(2.2) MMI-^aOM = lim M^aI^a-Aa) for \i - a.e. uj. 

n— >oo 

The sequence on the right hand side of ()2.2|) is defined by elementary 
conditional probabilities: 

Mt7a^a„\a) 
M^a„\a) 
Define 



M*7a|wa»\a) 



(2.3) Ga = : the limit on RHS of (Q exists for all r/ A G fi A }, 
and denote this limit by p\(r)\u). For A C A n , let 

Qa" = £ G An : PAn^AWAnXAk) > 0}. 



Variational Principle for Generalized Gibbs Measures 5 



Finally, let 

c 

WA=un^ 



n j=n 

and define 7a by 



(2.4) 7 a(7?|u;) 



Theorem 2.3. The family 7 = {7a}ags given by \2.J$ is a specifica- 
tion, and fi E G(j)- 

Suppose 7 is a specification, and /1 is cosnsistent with 7. Therefore 
MVa c ) = 7a(-|<^a c ) fi-a.s. 
Taking (|2.2|) into account we conclude that 
(2.5) 7a(??a|^aO = lim n(r] A \uj An \ A ) 

n^oo 

for all r]\ and /i-almost all u. An important problem for establishing 
the variational principle for generalized Gibbs measures, is determining 
the set of configurations where the convergence in ()2.5j) takes place. 



3. Properly supported measures 

Theorem 3.1. Let \x be a measure consistent with the specification 7. 
Suppose that v is another measure such that h{y\\i) = 0. If for v-almost 
all uj 

(3-1) m(Ca|w a „\a) -> 7(6v|^A<0, 

then v is consistent with the specification 7, i.e., v G £(7). 

Remark 3.1. Note that by the dominated convergence theorem, con- 
vergence in ()3.1j) is also in L\{y). 

Remark 3.2. If /i is an almost Gibbs measure for the specification 7, 
and v is a measure concentrating on the points of continuity of 7, i.e., 
i/(f2 7 ) = 1, then ()3.1|) holds, see the proof below. 

Proof of Theorem \3.1\ Suppose that h{v\^i) = 0. Then jBJ Theorem 
15.37] for any e > and any finite set A, and every cube C such that 
ACC, there exists A, CCA such that 



£. 



(3.2) MI/a-/a\a|)< 
where for any finite set V, fy is the density of v\y with respect to 

JV{W V ) = — r. 
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Rewrite (|3.2j) as follows: 



A*(I/a — /a\a|) = ^2 Kvav^a) 



v(VaUa\a) v(ua\a) 
/ ,(^\v(vau A \a) Kvaua\a) \ 

) ¥a\a Ln s ? r r 



VA,"A\A 

, fv^l u ( VaUa\a) _ Mw^avQ 
WAVA „ A ("a\a) M^a\a) 

= ^ ^(w A \a)||i / a(Va\a) - /m(Va\a)||tv 

= E v ||v A (-|o; A \ A ) - //aH^avOIItv, 

where || • is the total variation norm. 

A measure v is consistent with the specification 7 if 

^a(Va<0 = 7a(Va<0, v-a.e., 

or, equivalent ly, 

E^IKH^v) - 7a(-|^a<=)IItv = 0. 

Obviously one has 

E„||z/ A (-|w A e) - 7a(-|w a «)IItv 

<E i ,||z/ A (-|^ Ac ) - z/ a (-|^a\a)||tv 

+ E i/ ||i/ a (Va\a) ~ ^a{-\^a\a)\\tv 
+ E !/ ||/x a (-|o;a\a) - 7a(Va0IItv- 
By the martingale convergence theorem, the first term 

E,|KOa\a) - M'MWtv 0, as A / Z d . 

The second term tends to due to (|3.2j) . and the third term tends to 
zero because of our assumptions. □ 

3.1. Weakly Gibbs measures which violate the Variational Prin- 
ciple. Disordered systems studied extensively by Kiilske jHUHl provide 
a counterexample to the variational principle for weakly Gibbs mea- 
sures. In [TU], Kiilske, Le Ny and Redig showed that there exist two 
weakly Gibbs measures \i~ such that 

(3.3) h(n + \n~) = h(n~\n + ) = 0, 

but /i + is not consistent with a (weakly) Gibbsian specification 7~ for 
fi~, and vice versa. The novelty and beauty of their Random Field Ising 
Model example lies in the fact that both measures are non-trivial and 
the relation in ()3.3|) is symmetric. As already mentioned above, pre- 
vious almost Gibbs and weakly Gibbs ^3] example violating the 
variational principle satisfied h(S\fi) = with 5 being a Dirac measure. 
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What happens in the situation when h{y\ix) = 0? Suppose v is 
consistent with the specification 7. Then 

EvII/mH^avO - 7a(-|^A c )||tv <Ej,||)Ua(-|^a\a) - va(-\u A \a)\\tv 

+ E^||z/ a (Va\a) - 7a(-|^a°)IItv- 

Again, the first term on the right hand side tends to zero because 
h(u\n) =0, and the second term tends to zero, because of the martingale 
convergence theorem. Therefore 

||/Ua(-|^a\a) - 7a(-|^a c )IItv — >■ 0, 

in Li(u), and since the total variation between any two measures is 
always bounded by 2, we have that 

Ma(-|^a\a) —> 7a(Va c )> v - a -s- 

Since 7 is a specification for u, and since the "infinite" conditional prob- 
ability for n are defined as the limits of finite conditional probabilities 
(provided the limits exist) we conclude that 

(3.4) fi A (-\u A c) = u A (-\u A c), u-a.s. 

In fact ()3.4j) was obtained in a different way earlier by Follmer in 
Theorem 3.8]. It means that h(y\jjk) = implies that the conditional 
probabilities of /1 coincide with the conditional probabilities of v for v- 
almost all u. However, as the counterexample of ^U] shows, this result 
is not suitable for establishing the variational principle for the weakly 
Gibbsian measures, because the conditional probabilities can converge 
to a "wrong" specification. Condition (|3.1|) is instrumental in ensuring 
that this does not happen. 

4. Almost Gibbs Measures 

In this section we show that the condition (|3.1|) holds for all measures 
v which are properly supported on the set of continuity points of almost 
Gibbs specifications. 

Definition 4.1. A specification 7 is continuous in uo, if for all A G S 

sup|7A(a A |u;A n \A77A=) - 1 'hip A \u Ac )| -^0, as n -> 00. 

<T,7J 

Denote by fi 7 the set of all continuity points 0/7. 

Definition 4.2. A measure \x is called almost Gibbs, if fi is consistent 
with a specification 7 and /i(fi 7 ) = 1. 
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Remark 4.1. Note that we define almost Gibbs measures by requiring 
only that the specification is continuous almost everywhere. We do not 
require (as it is usually done, see e.g. that the corresponding spec- 
ification is uniformly non-null, in other words satsfies a finite energy 
condition: for any A G S, there exist a\, b\ G (0, 1) such that 

a A < inf 7a(£a|^a c ) < sup 7a(£a|^a<0 < ^a- 
Theorem 4.3. If fi is an almost Gibbs measure for specification 7 ; and 

1/(0,) = 1, 

then 

(4.1) KU\vAn\A) -> 7(£a|wa«0 

for all £a an d v-almost all u. 

Proof. Since ^6^(7), jti satisfies the DLR equations for 7, and hence 
M£a^a„\a) = J lA n (£AU An \A\r) A c)n(dr)). 

Similarly 

(4.2) 

( , 1 v M£a^a„\a) ! lAMA^AnXA^A^^dT]) 

M£a|^a„\a) - 



Ef A KU^A n \\) E^ A / 7A„(a^A„\A|^A ?l )/i(^) 

Since 7 is a specification, for all £, u, r\ one has 

(A n\ 7A n (^A n \A|^A-) , t . v 

(4-3) — -~ = 7a (£a I ^A n \AVA^ ) ■ 

Lf A 7A n (U^A„\A|^) 

Let 

r n (w) = sup |7a(^a|^a„\a?7a-) - 7a(^a|^a0I- 

?A,»?A n \A 

Therefore, using (|4.3jl . we obtain the following estimate 

7a(£a|<^a c ) - ^n(^) < M6a|<^a„\a) < 7a(£a|^a c ) + r n (u). 
Since r n (o;) — > for u; G fi 7 , we also obtain that for those a; 

M£a|^a„\a) -> 7a(£a|^v)- 



□ 



Remark 4.2. In the case /x is a standard Gibbs measure and 7 is the 
corresponding specification, one has f2 7 = f2 and hence by repeating 
the proof of Theorem 14.31 one obtains (|3.1|) for all measures v. 
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5. Regular Points of Weakly Gibbs Measures 

As we have stressed above the crucial task consists in determining the 
regular (in the sense of ([3.1)1 ) points for fi. In this section we address 
this problem in the case of weak Gibbs measures. A weakly Gibbsian 
measure \i is a measure for which one can find a potential convergent 
on a set of [i- measure 1, but not everywhere convergent. 

Definition 5.1. Let fi be a probability measure on (f2,jF) ; and U = 
{[/(A, ■)} is an interaction. Then the measure /i is said to be weakly 
Gibbs for an interaction U if is consistent with 7^ (\i G G{l u ) ) and 

fi(£lu) — ^({^ '■ H1(lu) is convergent VA G S}) = 1. 

Is it natural to expect that the set of points Qu where the potential 
is convergent, coincides with the set of points regular in the sense of 
([3. ID )? The definition 15.11 is rather weak. It is not even clear whether 
in the case of weak Gibbs measures the following convergence holds: 
for any finite A, any £a, and /i-almost all o>, 77 

Ta(^a|^a„\a^) — ► TaCSaJwaO as A ™ T 
Note that ()5.1|) is a natural generalization of a characteristic property 
of almost Gibbs measures (see definition 14.1)1 . 

We suspect that ([5.1)1 does not hold for all weakly Gibbs measures. 
However, the counterexample should be rather pathalogical. Most of 
the weakly Gibbs measures known in the literature should be (are) 
intuitively weak Gibbs as well. 

We introduce a class of measures which satisfy (j5.1[) . 

Definition 5.2. A measure \x is called intuitively weakly Gibbs for 

an interaction U if fi is weakly Gibbs for U, and there exists a a set 
Vty 9 C Qjj with fi(Q T J 9 ) = 1 and such that 

7a(^aI w a„\a^a-) — >7a(£aI^a c ) 
as A n I Z d ; for all u, r} G Q r J 9 . 

This definition of "intuitively" weakly Gibbs measures is new. How- 
ever, it is very natural, and in fact, this is how the weakly Gibbs mea- 
sures have been viewed before by one of us, c.f. j2]: "... The fact that 
the constraints which act as points of discontinuity often involve config- 
urations which are very untypical for the measure under consideration, 
suggested a notion of almost Gibbsian or weakly Gibbsian measures. 
These are measures whose conditional probabilities are either contin- 
uous only on a set of full measure or can be written in terms of an 
interaction which is summable only on a set of full measure. Intu- 
itively, the difference is that in one case the "good" configurations can 
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shield off all influences from infinitely far away, and in the other case 
only almost all influences. 

The difference between the Gibbs, almost Gibbs, and intuitively 
weak Gibbs measures is that 

7a(^a|^a„\a^a ?1 ) — ► 7a(Ca|^Ac) as A n | Z d , 

holds 

• for all uj and all 77 (Gibbs measures); 

• for yU-almost all uj and all rj (almost Gibbs measures); 

• for yU-almost all uj and /i-almost all rj (intuitively weak Gibbs 
measures) ; 

A natural question is whether there exists an intuitively weak Gibbs 
which is not almost Gibbs. An answer is given by the following result. 

Theorem 5.3. Denote by G, AG, WG and IWG the classes of Gibbs, 
almost Gibbs, weakly Gibbs, and intuitively weakly Gibbs states, respec- 
tively. Then 

G C AG C IWG C WG. 

Proof of Theorem \5.?A The inclusion G C AG C WG was first estab- 
lished in The inclusion IWG C WG is obvious. In fact, the result 
of [12] implies AG C IWG as well. 

Let us now show that AG 7^ IWG. In [12J, an example has been 
provided of a weakly Gibbs measure, which is not almost Gibbs. This 
example is constructed as follows. Let Q = {0, 1} Z+ and fi is absolutely 
continuous with respect to the Bernoulli measure v = 5(1/2, 1/2) with 
the density / 

f(uj) = exp(-#»), 

where H u is a Hamiltonian for the interaction U, which is absolutely 
convergent //-almost everywhere. The interaction is defined as follows. 
Fix p < 1 and define 

(5.2) U([0,2n},oj) = t^2nP n ~ %nH W 

where 

N 2n (u) = max{j > 1 : w 2n w 2n -i . . . oJ 2n -j+i = 1} 



if oj 2n = 1 and N 2n = if u 2n = 0. Moreover, U(A, u) = if A ^ [0, 2n}. 
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It is easy to see that H u {uj) = J2 n>0 U([0, 2n], to) is convergent for 
z/-a.a. uj. However, H u (uj) is sufficiently divergent, so that the condi- 
tional probabilities 

Ufa = 0|afr ... Un ...) = 1 + eM _ H l {luJiUj2 0) 

are not continuous //-almost everywhere. Therefore, /i is not almost 
Gibbs. 

Nevertheless, the exists a set tt' C {0, 1} Z + such that /j(JI') = 1 and 
for every u;, £ G Q' 

H U (luJ c) = H U (lu}[ 1: 2n]U[0,2n]c), H U (l^[l,2n]£[0,2n] c ) < OO, 

and 

-^^(l^Il^nl^O^njO — >• #^(l^[l,2n]^[0,2ra] c )> n ^ OO. 

Define 

B k = {i]e {0, 1} Z + : i]2kV2k-i ■ ■ ■ V[3k/2] = 1}, 

b = n u 

xeNfc>x 

Clearly, 

f(5 fc ) < 2- fe/2 , and i/(S fc ) < oo. 

k 

Hence, by the Borel-Cantelli lemma v(B) = and since A*(-B) = 

0. 

Moreover, for every uj G B c , H u (oo) < oo. Let Q' = B c and consider 
arbitrary uj, £ G Q'. Then 

|-H' E/ (l^[l,2n]C[0,2Ti] c ) - f^(l^[l,oo))| 

= \^ U([0, 2p], lu [lM i [GM c) - U([0, 2p\, lu [1>oo) ) | 

p>0 

(5.3) < J2 |^([0,2p],l^ [li2n] e[ ,2n]0-^([0,2p],l^ liOo) )| 

p>n+l 

< U([0,2p],lu ll , 2n] S [oM c)+ U([0,2p],lu lltOo) ) 

p>n+l p>n+l 

The sum ^2 p>n+1 U([0, 2p\, lu[i >O0 )) converges to zero as n — > oo since 
it is a remainder of a convergent series for H u (lu>[i j00 -)) . To complete 
the proof we have to show that 

(5.4) S:= U([0,2p],luj [lt2n] ^ 2< ) 

p>n+l 
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converges to as well. 

For every rj £ Q' = B c there exists K = K(r)) such that 

V2kV2k-l ■ ■ ■ V[3k/2] = 

for all k > K. Let K x = K(lu),K 2 = K(£) and K = max(K 1 ,K 2 ). 
Suppose n > K. Write the sum for S in (|5.4|) as Si + S 2 , where 

[4n/3]+l oo 

* = £ • s 2= E • 

p=n+l p=[4n/3]+2 

Let us estimate the second sum first. Since p > n > max(Ki, K 2 ) > 
K(£), we have that 

£,2p£,2p-l • • • £[3p/2] — 0- 

Moreover, since p > [4n/3] + 2>4n/3 + l, one has [3p/2] > 2n and 
therefore {7([0, 2p], l^[i,2n]£[o,2n] c ) does not depend on uj[\^n]- Hence 

oo 

S 2 = ^ ^([°> 2 P]>l^[i,oo]) -»• 0, as n -»• oo. 

p=[4n/3]+2 

Let us now consider the first sum 

[4n/3]+l 

Si = X] ^(t ' 2 ^]' lu; [l,2n]^[0,2n]0- 
p=n+l 

Terms in S±, in principle, depend on u)h 2n ]. For this one has to have 
that 

^p&p-l • • • iin+i = 1, 
and few of the last bits in uj[\ t2n ] are also equal to 1. Suppose uj t = 
. . . = oj 2n = 1. Note, however, that since n > max(Kx, K 2 ) > K(u), 
necessarily t > [3n/2]. Therefore, 

U([0, 2 P ), l^fo*].) < f?-^-^ < p 3n/2_ P -2 < p 3n/2-4n/3-3 = ^/6-3 > 

and since p < 1 

Si < np n/6 ~ 3 -> 0. 

□ 

Another example of an intuitively weakly Gibbs measure, which is 
not almost Gibbs, is the finite absolutely continuous invariant measure 
of the Manneville-Pomeau map j!4j . The reason is that for every u 

7a(^0 = l|^[l,n]0[„+i,oo)) = 0, 

where is a configuration made entirely from zeros. Thus the config- 
urations finishing with an infinite number of zeros, are the bad config- 
urations, causing the discontinuities in conditional probabilities. (One 
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can also show that there are no other such configurations.) Since there 
is at most a countable number of bad configurations, this set has a 
/x-measure equal to 0. 

Yet another example of a measure which should be IWG is the re- 
striction to a layer of an Ising Gibbs measure. 

The reason for this is the following. The example considered above 
in Theorem 15.31 the absolutely continuous invariant measure for the 
Manneville-Pomeau map, and the restriction of an Ising model to a 
layer, have a very similar property in common. Namely, for every 
"good" configuration to, there is a finite number c = c(u) such that 
|t/(A uj)\ starts to decay exponentially fast in diam(A) as soon as 
diam(A) > c(u). In the example above, c(u) = K(u). In fact, the 
"good" configurations are characterized by the property that c(u) < 
oo. This random variable c(u) was called a correlation length. The 
main difficulty is in estimating the correlation length c(C, A u An \ A 7] A n) f° r 
the "glued" configuration £a^a„\a?7ac in terms of correlations lengths 
c(^\co\c) and c(^a^a c )- Estimates obtained in [T3| should provide enough 
information to deal with this problem in case of the restriction of the 
Ising model to a layer. 

Let us proceed further with the study of regular points of an intu- 
itively weak Gibbs measure //. We follow the proof of Theorem 14.31 
Firstly, one has 

fr K \ 7A„(£A^A n \A|7?A=) ft i x 

(5-5) — -~ ■ = 7aUa|^a„\a^aJ- 



E £ - a 7a„(^a„\a|^ 



Let 



r n(v) = sup^a^aI^a^a^) - Ta(Ca|w a «)|- 
€a 

Since 7 is a specification, \r n \ < 2. Moreover, since /x is intuitively 
weak Gibbs, then for uj e Q r J 9 , r^(rj) — > for //-almost all 77. Fix e > 
and let 



A"={v-K(v)\>e}- 



Then 



KU^A n \A) = 7a(£a|wa<0M w a„\a))+ / r^(rj)y2^ An (i A u An \ A \r] A c)fi(drj) 

Jn 

?A 
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and we continue 

M6v^a„\a) 



M^a„\a) 



(5.6) 



7a(£a|<^a c ) 



< e + 2 



/ E| A 7a„(^a„\a|?7a- )m W 

E ? - A 7A„(a^A„\A|% ?l )/i(^) 



M^a„\a) 

where we used that |r^| is always bounded by 2. 
Let us estimate the remaining integral 



/ y]7A„(lA^A„\A|?7Ac)K^) = / I ^ n (^)(7Ajc A?l \ A )(^)/i(^), 

w a„\a}- 

) 



where Iw An \ A is the indicator of the cylinder set {( : Ca„\a 
The set Af is JF Ac -measurable, therefore 



I^e.n (7A n W\ A ) = 7A n (lA- -I 



and since /i satisfies the DLR equations with 7, we obtain that 

/ ^7An(lAk>An\A|»7A«M*7) = ^JV^A^iv)^) 

= M^,n nw A ?l \A)- 

Therefore, we obtain the following estimate 



- 7a(£a|^a c 



mOa„\a) 
ow 

/i(^ A „ AA n A£ n ) 



< e + 2 



M w A r 



\a; 



Now, if we can show that for all u; G 



m(^a„\a) 



0, as A„ t Z a 



we will be able to conclude that all points Q r J 9 are regular in the sense 

offnj). 

Let us now turn to the example of an intuitively weak Gibbs measure 
considered in Theorem 15 .31 As usual for the Gibbs formalism, we check 
the required property only for A = {0}. We also let A n = [0, n), hence 
A n \A=[l,n]. 

Let uj G Q T J g . Hence the potential is convergent in lo^[i,oo)> and 
therefore exp(— H(l u\i )OO ^)) > 0. Choose arbitrary e > such that 



e < -exp(-H(l Q U[ 1)00 y)). 
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The measure \x is absolutely continuous with respect to the Bernoulli 
measure v = -8(1/2, 1/2). First of all, let us show that 



(5.7) 
implies 

(5.8) 



v{u [lM n A» n ) 



0, as n — > oo, 



0, as n — > oo, 



Since i?(C) is non-negative (possibly infinite) for any £, one has 



< I ^C) = K^\An< n ). 



^A„\An^ ri 



Consider the set W = 0J\i, n ] H (Aj? ) c . For every 77 e W we have 



sup 

?0 



7o(£ok[l,n]?7[n+l,oo)) - 7o(6l^[l,oo)) 



< e. 



In particular 



exp(-if(l o;[i, n ]77[ n+ i i oo))) exp(-if(l a;[i, oo ))) 



exp(-if(l W[i, n ]?7[n+i,oo))) + 1 exp(-fZ"(l W[i >00 ))) + 1 



<£, 



where we have used the fact that i?(0 C[i,oo)) = for all (. Note 
also that since u, rj e ti^ 9 , both H(l U[ ljOO )), #(lo<^[i,n]?7[n+i,oo)) are 
non-negative and finite. Therefore 



exp(-if (l W[i, n ]?7[ n+ i,oo))) - exp(-fZ"(loW[i,oo))) 



< 4e. 
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Hence, 



C) c ) 



= /i(i„ n w [ljn ] n « n ) c ) + /i(o n w [ljn ] n (A U E 

J exp(-H(C))u(dO 

i nu [hn] n(A£ n ) c 

+ j ex P (-H(C))v(d() 

o nu> ll)n] r\(A% tn y 

> (exp(-#(l u; [liOo) ) ~ 4e)i/(l n w [1>ft] n « >n ) c ) 

+ z/(o n^ [lin] n(^ n ) c ) 

> CV(u, [1)n] n «J C ), 

where C = exp(— i?(l a;[i )OO )) — 4e > (note that C < 1). Therefore, 

ti(u M n Ai n ) < c _ x p{uj [1m n A-J 



M W [l,n]) 



c- 



Z/(a>[l,n]) 1 - u H^] nA e,n) 
"{U \, n ) 



and hence (|5.7|l indeed implies (|5.8jl . 

Let us now proceed with the proof of (|5.7j) . Since i/ is a symmetric 
Bernoulli measure, ^(wri^]) = 2~ n . 

If x, y > then 



< |x — y\. 



l + e~ x 1 + e~f 
Therefore, if rj e i.e., 



1 + e- 



1 + e~» 



sup 



To(^o|^[l,n]?7[n+l,oo)) _ 7o(^o|^[l,oo)) 



> e, 



then 
(5.9) 



-^(lo^[l,n]^[n+l,oo)) — #(lo^[l,oo)) 



> £. 



Hence, if we define B^ n as a set of points 77 such that ()5.9j) holds, we 
get that Al n C fl£ n . 

To estimate the measure of I?^ n we have to use the estimates from 
the proof of Theorem 15.31 Without loss of generality we may assume 
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that n is even, n = 2n' . Let us recall the estimate (|5.3|) 

\H(lo^[l,2n']V[2n'+l,oo)) ~ H (l U[i jOO ))\ 

0^[l,oo) / 

p>n+l p>n'+l 

The second sum on the right hand side does not depend on 77, and 
converges to as n' — > oo. Therefore, by choosing n! large enough we 
must have that if rj G B" n then 

^ U([0,2p], lo^[l,2n']^[2n'+l,oo)) > |- 
p>n+l 

Let us define a sequence <5 P = p 01p , p > 1. Since p G (0,1), for 
sufficiently large n' one has 

E ^ < % 

p>n'+l 

Consider the following events, 



Cp = |r/ : U([0,2p], l ^[l,2n']^[2n'+l,oo)) > 5p 

Obviously, 

p>n'+l 

In general, for arbitrary (, U([0, 2p], () > 5 P if (see (15. 2|) ) C = ( 2p 
N 2p (C) < V and and pP~ N ^0 > p o.i P _ Therefore, 

0.9p < N 2p {() < p, 

and hence 



p 



v((:U([0,2p],0>S P )< E 2- fc < 2-°* +2 =: V 

fc=[0.9p] 

Let us continue with estimating the probability of C p . If p > 2n ; , then 
does not depend on u>, and hence using the previous estimate 

For the small values of p, p G [n' + 1, 2n'], we have to proceed differ- 
ently For such p's the configuration rj can "profit" from the last bits 
(equal to 1) in uj. Since u is a regular configuration (see Theorem 15. 3|) . 
for sufficiently large n', uj G G n >, where 

G n > = {C : (2k ■ ■ ■ C[3fe/2] = VA;>n'}. 
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In particular, it means that at most n'/2 + 1 of the last bits in cj[ 12n '] 
are equal to 1, and in the worst case, o; 2 n' • • • w [3n'/2]+i = 1- From now 
on we assume that u>2n' ■ ■ ■ w [3n'/2]+i = 1- 
We split the set of "bad" r/'s as follows: 

C p = {V -U([0,2p], l W[l,2n']»7[2n'+l 1 oo)) > $p} 

= {l] : C/([0,2p], lt)W[l,2n']»7[2n'+l,oo)) > $p & %n'+l ■ ■ ■ V2p = 0}U 
{?? : C/([0,2p], l W[l,2n']»7[2n'+l 1 oo)) > S p kr] 2n '+i ■ ■ -r]2p = 1} 

= c;'°uc;- 1 . 

Again, the set C^' does not depend otiw. In fact, ,0 is not empty 
only for p's close to 2n': on one hand, p — N 2p < O.lp and on the 
other, A" 2p < 2p — 2n'. Together with the fact that p £ (n',2n'], this 

/ 2 i 

is possible only for p £ v YT n '' ' an ^ ^ * n ^ S ^ n ^ erva ^' one 
would need more than 0.9p ones, hence making a //-measure of C£ ,0 
sufficiently small: 

i/(C£'°) < 2~ a9p . 

Finally, the elements of C^' 1 are precisely the configurations which 
can profit from the fact that the last few bits in tou^n 1 ] are equal to 
1. For such 77's, in a "glued" configuration ( = 1o^[i,2ti']^[2ti'+i,oo) a 
continuous interval of l's is located starting from position [3n'/2] + 1 
and finishing at position 2p. In order to have a positive contribution 
from U([0, 2p\, () a long run of l's should not be too long. Namely, 



N2 P (C) = 2p 



3ra' 



< 



implying that p < [3n'/2], and hence, C^' 1 is empty for p > [3n'/2]. 
For, p £ [n' + 1, [3n'/2]] one has 

U([0,2p],() = P p - N2p(0 = P [3n ' /2] - p - 

Hence if C/([0,2p],C) > p 0Ap , then [3ra'/2] - p < O.lp and hence p > 
[3n'/2]/l.l. Once again that means that C^' 1 is empty for p £ [n' + 
l,[3n'/2]/l.l-l]. 
Therefore, for pG [n' + 1, 2n'] we conclude that 



C^' 1 C {?? : i]2n'+i = ■ ■ ■ = V2 P = 1} if 
and C^' 1 = 0, otherwise. In any case, 

HCp' 1 ) < 2~ 2p+2n '. 



1 

LI 



[?>n'- 




r3n'i 


L 2 . 


< P < 


I 2 J 
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We obtained that 

< v(u[i,2n>]nc%) = s 1 + s 2 + s 3 + s 4 , 

p>n'+l 

where Si, S2, S3, S4 are sums over integer p's in intervals Ii = [n 1 + 
l,[3n'/2]/l.l), h = [[3n'/2]/l.l, [3n'/2]], J 3 = [[3n'/2] + 1, 2n'], and 
I4 = [277,' + 1, 00), respectively. We have the following estimates 

Si = "Hi,**] n C?) = ^ z/(^ [li2n1 n C£<°) 

pe/i peli 

o-0.9n' 

9~2n'_f ^ o o-2. 9n' 

~ 1 _ 2-0-9 ^ 6 ■ 1 

s 2 = J2 K^w] n c->°) + ^ 1/(^1,^] n cf) 
pe/2 pe/2 

^ 2~ 2n ' 2 _0 - 9 p 4- 2~ 2n ' 2 _2 P+ 2n ' 
pe/2 pG/2 

n„/ 0.9 3rc/ 2 3n' 

< T 2n ■ 12 ■ 2~~'~ + 12 ■ TTX-— 

< 12 ■ 2~ 3 ™' + 12 • 2~ 2 - 7n ' < 12 ■ 2~ 2 ' 7 ™'; 

= 2- 2n ' ^ 2-°- 9p < 2- 2n ' • 12 ■ 2"^ < 12 ■ 2" 3 "'; 
pe/3 

5 4 = ^zy(^ [lj2n , ] nL7-)<12-2- 3 - 8n '. 
pe/4 

Finally, we conclude that 

Kajljnj n J ^ + g 2 + g 3 + g 4 

; — : — < —. > as n — > 00. 

K^[l,2n']) " 2- 2 "' 

To summarize our result, we formulate the following theorem. 

Theorem 5.4. Let \i be the (intuitively) weak Gibbs measure, but not 
almost Gibbs, discussed above in Theorem \5.fy and which has been in- 
troduced in fTiy . Then there exists a set fl' such that /x(fi') = 1 and 
the following holds: 

• the potential U is absolutely convergent on fl' ; 



20 



A. CD. van Enter, E.A. Verbitskiy 



• for all uj, rj G fl', any finite A and all £a G one has 

^a((awa„\a^) — > H a (£ a uac), 

7a(^a|^a„\a^) -» 7a(£a|^a<0, 
as A n — > Z_|_. 

• every u E Q' is regular in (Goldstein's) sense: for every £a 

M£a|^a«\a) -> Ta(^a|wa«), 

as A n — ► Z_|_. 

6. Bit-shift channel 

A somewhat different kind of non-Gibbsian example comes from an 
industrial application: data storage on magnetic tape or optical disks 
(CD, DVD, etc). Before formulating the model precisely, let us explain 
the mechanism which leads to a non-Gibbsian measure. 

The medium for magnetic or optical data storage can be in one of the 
two states: "high" and "low" , or "bright" and "dark" . The information 
is encoded not in the state of the medium itself, but in transitions 
between these states, and more precisely, in "units of time" between 
two successive transitions. In the following table the first line indicates 
the state of the medium (H(igh) or L(ow)) and the second line indicates 
the corresponding occurrence (1) or absence (0) of transitions: 

... LHHHHLLLHHHHL ... 
... 1 1 1 1 ... 

An equivalent way to represent the second line is to record the number 
of zeros between consecutive ones. In the case above, one obtains a 
sequence (. . . , 3, 2, 3, . . .). For technical reasons, in data storage one 
often uses coding schemes such that the transitions are never too close, 
but also not too far away from each other. This is achieved by using 
the so-called run-length constrained codes. 

When the magnetic medium or optical disk are read, due to various 
effects like noise, intersymbol interference or clock jittering, the transi- 
tions can be erroneously identified, thus producing a time-shift in the 
detected positions. 

Suppose in the example above the following error has occurred: 
the second transition has been detected one time unit too late. The 
resulting sequence then is (...,1,0,0,0,0,1,0,1,0,0,0,1,...). And 
the corresponding representation in terms of runs of zeros will be 
(. . . , 4, 1, 3, . . .) instead of (. . . , 3, 2, 3, . . .). 

The following description of a bit shift channel is due to Shamai and 
Zehavi, [T5] . 
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Let A = {d, . . . , k}, where d, k G N, d < k and d > 2. Define 
X = A z = {x = (xi) : Xi G A}, Q = {-1, 0, 1} Z = = (wi) : w< G 
{ — 1, 0, 1}}. Consider the following transformation if defined on X xQ 
as follows: y = ip(x,uj) with 

Hi = Xi + uJi — 0Ji^\ for all j G Z. 

Note that y is a sequence such that y^ G {0, . . . , + 2} for alH, but 
not every sequence in {0, . . . , k + 2} z can be obtained as an image of 
some x G X, u G Vt. For example, all image sequences y = ip(x, d) 
cannot contain 00. Indeed, suppose y% = for some i. This is possible 
if and only if Xi = 2, UJi = —1, and = 1. But then y i+ i = 

x i+1 + uj i+ i -Wi>2-1 + 1 = 2. 

Since ip is a continuous (in the product topology) transformation the 
set Y = <f(X x ft) is a so-called sofic shift, see [TT] . 

Suppose /i and n are product Bernoulli measure on X and f2 with 

Kj) = Pji j = d,...,k, 7r(— 1) = 7r(l) = e, 7r(0) = 1 - 2e. 

The measure /x describes the source of information and 7r describes the 
jitter (noise). 

Let v = (/i X 7r) oyj -1 be a corresponding factor measure on Y defined 

by 

v(C) := (/i x 7r)((/? _1 C) for any Borel measurable A CY. 

Despite the fact that some configurations are forbidden in Y, in other 
words, we have some "hard-core" constraints, there is a rich theory of 
Gibbs measures for sofic subshifts. One of the equivalent ways to define 
Gibbs measures is as follows. We say that an invariant measure p on 
Y is Gibbs for a Holder continuous function if : Y — > K and constants 
P and C > 1 such that for any y G Y one has 

C 7^ nr~\ — r~T7TpY - ' 

ex P(E fc =o <PK<r*y) l)P) 

where a : Y — * Y is the left shift. The function ip is often called a 
potential, and has a role analogous to that of fu(-) = J2oeA 
for standard lattice systems. The constant P in (J6.1|) is in fact the 
pressure of 

Now, often called the Bowen-Gibbs property, implies that for 

all y G y 

(6.2) - Ci < logp(y |2/i, • • • , Vn) < <p(y) + Ci, 

for some positive constant C\. Since Y is compact, and ip is contin- 
uous, we conclude that for every y and all n G N the logarithm of 
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the conditional probability p(yo\yi, ■ ■ ■ , y n ) is bounded from below and 
above. 

It turns out that v = (fi x tt) o ip~ l is not Gibbs. As usual in the 
study of non-Gibbsianity we have to indicate a bad configuration. In 
our case, configuration 02°° is a bad configuration for v. Consider 
cylinder [y , . . . , y n ] where 

l/o = 0, y 1 = ... = y n = 2 

Then effectively there is a unique preimage of this cylinder. Indeed 
yo = 0, and as we have seen above, this is possible only for 

Xq = 2, uj = —1, cj_i = 1. 

For the next position i — 1 we have 

2 = yx — X\ + U\ — luo — x\ + lo\ + 1. 

Again, since uj\ + 1 > and X\ > 2, this is possible if and only if 
l)\ = —1 and xi = 2. But then x<i = 2 and uj 2 = —1, and so on. 
Therefore 

^([0,2, 2,-^2]) C g^.^JJ x [-1,-1,..., -lj , 



n times n+1 times n+1 times 



and hence 

z/([0, 2, 2, . . . , 2]) < /i([2, 2, . . . , 2])tt([-1, -1, . . . , -1]) = (p 2 e) n+1 . 

On the other hand, cylinder [2,2,..., 2] has many preimages. For ex- 
ample, with appropriate choice of cu's cylinders of the form 

[x u ...,x n ] = [2,. ..,2,3,2,... ,2] CX 

will project into [2, . . . , 2]. Indeed, if j is the position of 3 in [x±, . . . , x n ], 
then the choice ujq = u>x — . . . = uij-i = 0, and Uj = ujj + i = . . . = u n = 
— 1 will suffice. Therefore 

n 
3=1 

and for e < 1/3, one has 

u([2,2,...,2])>n P r 1 P3 e n+ \ 

and therefore 

/ , s Kf0,2, 2,..., 21) C 

u(0 2, 2, .... 2) = u /r ' ^ < -, 

M ' ' ^ i/([2,2,...,2]) " n 

and thus the logarithm of i/(0|2, 2, . . . , 2) is not uniformly bounded 
from below, and hence there is no Holder continuous ip such that (|6.2j) 
is valid for z/, and hence, z/ is not Gibbs. 



z/([2,2,...,2])>^ J 9r 1 p 3 (l-2e)V 
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A slightly more accurate analysis shows that v is not Gibbs for e > 
1/3 as well. 

An interesting open problem is the computation of the capacity of the 
bit-shift channel with a fixed jitter measure tt. This problem reduces 
to the computation of the entropy of the transformed measure v for an 
arbitrary input measure \x. In jTj an efficient algorithm was proposed for 
Bernoulli measures //. This algorithm produces accurate (to arbitrary 
precision) numerical lower and upper bounds on the entropy of v. 

7. Discussion 

In this paper we addressed the problem of finding sufficient condi- 
tions under which h{v\ix) = implies that v is consistent with a given 
specification 7 for \i. In particular, the question is interesting in the 
case of an almost or a weakly Gibbs measure \i. Intuition developed 
in [UUmE] shows that v must be concentrated on a set of "good" 
configurations for measure \i. In the case \i is almost Gibbs, v must 
be concentrated on the continuity points f2 7 , [10 . A natural general- 
ization to the case of a weakly Gibbs measure /i for potential U would 
be to assume that v is concentrated on the convergence points of the 
Hamiltonian H u . However, this is not true as the counterexample of 
[TU] shows. 

We weakened and generalized the conditions under which we can 
prove the first part of the Variational Principle. Moreover, we intro- 
duced the class of Intuitively Weak Gibbs measures, which is strictly 
larger than the almost Gibbs class, but contained in the Weak Gibbs 
class. 

The example considered in this paper shows (and we conjecture the 
same type of behaviour for other interesting examples of weakly Gibbs 
measures) that some weak Gibbs measures are more regular than was 
thought before. 
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